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Background: Colorectal cancer-specific biomarkers have been used as molecular targets for fluorescent intra-operative imaging, 
targeted PET/MRI, and selective cytotoxic drug delivery yet the selection of biomarkers used is rarely evidence-based. We 
evaluated sensitivities and specificites of four of the most commonly used markers: carcinoembryonic antigen (CEA), tumour- 
associated glycoprotein-72 (TAG-72), folate receptor-a (FRa) and Epithelial growth factor receptor (EGFR). 

Methods: Marker expression was evaluated semi-quantitatively in matched mucosal and colorectal cancer tissues from 280 
patients using immunohistochemistry (scores of 0-15). Matched positive and negative lymph nodes from 18 patients were also 
examined. 

Results: Markers were more highly expressed in tumour tissue than in matched normal tissue in 98.8%, 79.0%, 37.1% and 32.8% of 
cases for CEA, TAG-72, FRa and EGFR, respectively. Carcinoembryonic antigen showed the greatest differential expression, with 
tumours scoring a mean of 10.8 points higher than normal tissues (95% CI 10.31-11.21, P<0.001). Similarly, CEA showed the 
greatest differential expression between positive and negative lymph nodes. Receiver operating characteristic analyses showed 
CEA to have the best sensitivity (93.7%) and specificity (96.1%) for colorectal cancer detection. 

Conclusion: Carcinoembryonic antigen has the greatest potential to allow highly specific tumour imaging and drug delivery; 
future translational research should aim to exploit this. 



There is much interest in developing strategies for tumour- specific 
delivery of agents to increase our diagnostic capability or enhance 
the selectivity and effectiveness of therapeutics. These strategies 
rely on the ability to target tumour cells accurately using 
biomarkers that are differentially expressed between tumour and 
normal tissue. In colorectal cancer, the most commonly used 
biomarkers include carcinoembryonic antigen (CEA) (Muguruma 
et al, 1999; Sharkey et al, 2005; Kaushal et al, 2008; Yazaki and 
Kassa et al 2008; Heine et al, 2011), tumour- associated glycopro- 
tein-72 (TAG-72) (Tang et al 2007; Chen et al 2008; Zou et al 
2009; Chen et al, 2011), Epithelial growth factor receptor (EGFR) 
(Goetz et al, 2010; Jeong et al, 2012; Qi et al, 2012), and folate 



receptor-a (FRa) (Reddy and Low, 1998; Chen et al, 2005; Yang 
et al, 2010). Carcinoembryonic antigen and TAG-72 are mem- 
brane-bound glycoproteins that are expressed in over 80% of 
colorectal cancers with relatively little expression in normal 
mucosa (Johnson et al, 1986; Jantscheff et al, 2003). Folate 
receptor-a is a membrane-bound protein that binds and transports 
folic acid and is overexpressed in epithelial -derived cancers, 
including those of the colorectum (Shia et al, 2008). It appears to 
be expressed relatively infrequently in colorectal tumours but is 
absent in the vast majority of normal tissues, hinting at a 
potentially high specificity (Shia et al, 2008). Epithelial growth 
factor receptor is a well-known target for anti- cancer therapies but 
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the literature concerning the degree and frequency of EGFR 
overexpression in colorectal cancer is contradictory, with reported 
expression varying from 33% to 97% of cases (Koretz et al, 1990; 
Lee et al, 2002; Spano et al, 2005; Bhargava et al, 2006). It is also 
reported to be expressed in up to 48% of normal colorectal samples 
(Koretz et al, 1990), which could compromise its specificity for 
cancer tissue. 

Identification of biomarkers that are sensitive and specific for 
colorectal cancer opens up many avenues for diagnostic and 
therapeutic exploitation. The ability to visualise tumours and their 
metastases during surgery may facilitate more accurate and 
disease- specific operations. Fluorescence-based, laparoscopic ima- 
ging, using antibodies targeting antigens found on tumour cells, 
has already shown considerable potential (Mahmood, 2010; 
Tiernan et al, 2012). In non-invasive imaging, iodine- 124 labelled 
antibodies to CEA have been administered to enhance PET cancer 
scanning (Schoffelen et al, 2010; Boerman and Oyen, 2011; 
Carrasquillo et al, 2011; Meller et al, 2011; O'Donoghue et al, 2011; 
Schoffelen et al, 2012), and CEA antibodies have been conjugated 
to iron oxide nanoparticles to improve MRI imaging (Vigour et al, 
2010). In therapeutic applications, radioisotope antibodies, usually 
directed towards CEA or TAG-72, have been used to deliver 
tumour selective radiotherapy (Meredith et al, 1996; Meyer et al, 
2009; de Jong et al, 2011) and have been shown to be particularly 
effective in small-volume disease (Koppe et al, 2005; Barbet et al, 
2012). Similarly, targeted cytotoxic drug delivery has the potential 
to deliver high-dose chemotherapy to cancer cells, while limiting 
damage to normal cells, and has utilised a variety of biomarkers, 
including FRa (Li et al, 2011; Zhang et al, 2012), EGFR (Kopansky 
et al, 2011) and CEA (Conaghan et al, 2008). 

A surprising feature of all the above examples of targeted 
tumour delivery is that the choice of biomarker used is rarely 
justified in terms of its tumour sensitivity and specificity. The 
characteristics of suitable cancer biomarkers are that they must be 
highly expressed in a large majority of tumours and have low or no 
expression in normal tissue, encompassing the normal tissue from 
which the tumour is derived and - ideally - most or all normal 
adult tissues accessible to systemic delivery. Suitable molecules 
must also be located on the external facing surfaces of cancer cells 
so as to be accessible to systemically delivered recognition 
molecules. To our knowledge, expression of the most commonly 
used markers for colorectal cancer has never been rigorously 
evaluated in a well-defined cohort of cancer and normal tissue. We 
aimed to evaluate the sensitivity and specificity of the four most 
commonly used markers for colorectal cancer, CEA, TAG-72, FRa, 
and EGFR, to define which is most suitable for in vivo tumour 
targeting. 



MATERIALS AND METHODS 



Ethical issues and patient cohort. Ethical approval was obtained 
from the National Research Ethics Service ( London -Dulwich 
Committee), reference 12/LO/1327. Formalin-fixed paraffin- 
embedded tissue blocks containing primary colorectal carcinoma 
and matched normal colorectal mucosa, collected prospectively for 
the MRC CLASICC trial (Guillou et al, 2005), were available from 
280 patients. Table 1 shows patient and tumour characteristics. 
Formalin-fixed paraffin-embedded blocks containing mesenteric 
lymph nodes (both tumour cell positive and negative) were 
available for 18 of the patients. 

Antibodies and immunohistochemistry. Antibody suppliers and 
evidence for antibody specificity are described below. A5B7 
humanised anti-CEA antibody was supplied by the Cancer 
Research UK Biotherapeutics Development Unit (Clare Hall 
Laboratories, UK) and has been used in clinical trials of targeted 
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Table 1. Clinico-pathological detai 


s of patient cohort 




Characteristics 


Number of cases (%) n = 280 


Age range: 33-93 years 


Mean age: 68.9 years 


Male 
Female 


144 (51.4) 
136 (48.6) 


pT stage 


1 
2 
3 
4 

Unknown 


10 (3.6) 
54 (19.3) 
161 (57.5) 
52 (18.6) 
3(1.1) 



therapies (Dawson et al, 1991; Meyer et al, 2009). Mouse anti- 
TAG-72 antibody (clone CC49; Santa Cruz, CA, USA) has been 
used widely to target colorectal cancer in radioimmunoguided 
surgery (Sun et al, 2007; Povoski et al, 2012). Mouse anti-EGFR 
antibody (clone 31G7; Invitrogen, Carlsbad, CA, USA) has been 
shown to be highly sensitive in detection of EGFR positive 
colorectal tumours (Buckley and Kakar, 2007). Mouse anti-FR-a 
antibody (clone BN3.2; Leica Biosystems, Newcastle, UK) has been 
validated across a wide range of tissues, including colorectal tissue 
(Smith et al, 2007). Antibodies were optimised for use in 
immunohistochemistry on whole colorectal cancer tissue sections 
to give a broad range of epithelial staining intensities with minimal 
background staining. Tissue microarrays were constructed from 
formalin -fixed paraffin-embedded resection samples of colorectal 
cancer and matched normal colorectal mucosa using 0.6-mm 
cores. For each patient, three cores were selected from the most 
representative tumour area (as determined by haematoxylin and 
eosin staining) and three cores from normal mucosa. Tissue 
microarray blocks or lymph nodes blocks were sectioned onto 
SuperFrost Plus microscope slides (Menzel-Glaser, Braunschweig, 
Germany), dewaxed with xylene and rehydrated through ethanol 
before rinsing in water for 5 min. Epitopes were retrieved either by 
heating slides in pre-warmed 10 mM citric acid buffer (pH 6.0, 
microwave 900 W, 10 min; anti-CEA and anti-TAG-72), using a 
pressure cooker (anti-FRa), or incubating in proteinase K (25 min, 
37 °C; anti-EGFR). Endogenous peroxidase activity was blocked 
with 0.6% hydrogen peroxide for 10 min before rinsing in water for 
5 min. Slides were mounted in Sequenzas (Thermo Scientific, 
Waltham, MA, USA) and rinsed with tris-buffered saline (TBS) 
followed by antibody diluent reagent solution (Invitrogen). 
Primary antibodies were used in diluent reagent at 1 : 100, 
1:150 000, 1:25, 1:50 and 1:50 for anti-TAG-72, anti-CEA, 
anti-FRa, and anti-EGFR, respectively. Incubations were at room 
temperature for 1 h (CEA, TAG-72) or overnight at 4 °C (FRa, 
EGFR). Slides were washed twice with TBS-T (TBS, 10% Tween 
20) and once with TBS before incubating with an appropriate 
horseradish peroxidise conjugated secondary antibody (Dako, 
Glostrup, Denmark) for 30 min at room temperature. The slides 
were washed with TBS-T and TBS. 100 fA of S^'-diaminobenzi- 
dene (Dako) solution was added to each slide for 10 min before a 5- 
min wash in water. Slides were then stained with haematoxylin, 
dehydrated with ethanol and xylene, and mounted with DPX 
Histology mountant (Fluka, St. Louis, MO, USA). 

Scoring and statistical analyses. Slides were digitally scanned 
using Scanscope XT (Aperio, Vista, CA, USA) at 20 x magnifica- 
tion and were observed for scoring using ImageScope vll (Aperio). 
Staining was assessed semi-quantitatively using a bespoke scoring 
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system developed in consultation with two pathologists (ETV and 
NPW). Cores were scored for intensity of epithelial cell staining (0, 
no staining; 1, mild; 2, moderate; 3, strong) and percentage of 
epithelial cells staining positively (0, <5%; 1, 5-20%; 2, 21-40%; 3, 
41-60%; 4, 61-80%; 5, 81-100%). These scores were multiplied to 
give final scores of 0-15. Tumour cores with no tumour cells and 
normal cores with no epithelial cells visible were discounted. 
Carcinoembryonic antigen and TAG- 72 were scored for apical 
membranous staining only. Folate receptor- a showed mild 
cytoplasmic staining only, and was scored solely for this. Epithelial 
growth factor receptor showed membranous, and - more rarely - 
cytoplasmic and nuclear staining. However, following established 
literature (Scartozzi et al 2004; Buckley and Kakar, 2007) EGFR 
was scored for membranous expression only. Two independent 
observers were involved in the scoring: JPT scored all cores and 
ETV scored cores representing 15% of the cases. Scoring 
reproducibility was determined for each antibody using the 
intra-class correlation coefficient; the process of calculating the 
mean score for each patient led to 61 possible outcomes, and 
therefore the data were treated as continuous rather than 
categorical. Scores for each case of tumour or normal tissue were 
means of scores allocated by JPT for each core from that tissue. If 
only one core score was available, for example owing to lack of 
appropriate cell types or core loss, this single score was used; a 
method that has been validated previously (Camp et al 2000; 
Torhorst et al, 2001)). Means for normal and tumour tissue were 
compared using Wilcoxon signed-rank tests for paired non- 
parametric data. Sensitivities and specificities were calculated from 
the mean scores for tumour and normal tissue expression for each 
case using two different cutoff points: the 95th percentile of the 
normal tissue score distribution and optimal cutoffs identified by 
receiver operating characteristic curves (Maraqa et al, 2008). 



RESULTS 



Expression of markers in normal colorectal tissue and tumours. 

We examined expression of CEA, TAG- 72, FRa and EGFR in a 
cohort of 280 colorectal carcinomas and matched normal tissues. 
Clinical and pathological features of this cohort are shown in 
Table 1. Representative images of staining for each antigen in both 
normal and tumour tissues are shown in Figure 1. All four markers 
showed epithelial cell specific staining that was diffuse with some 



luminal accentuation. No staining for any marker was noted in 
blood cells within the sections, suggesting that expression in this 
compartment is negligible. The intra-class correlation coefficient 
was 0.89, demonstrating good agreement between scorers. Both the 
proportion of epithelial cells staining positively and the intensity of 
staining varied widely throughout the cohort in normal and 
tumour tissues (Figure 2). Strongly positive staining in normal 
tissues was relatively uncommon for all antigens, especially for FRa 
(negative in over 99% at the concentrations used). In the tumours, 
by contrast, strongly positive staining was prevalent for CEA and 
TAG-72, and was more common for FRa than in normal tissue 
although tumours were also mostly negative (61%). Epithelial 
growth factor receptor staining in tumours showed a similar range 
of scores to those in normal tissues. Carcinoembryonic antigen, 
TAG-72 and FRa, but not EGFR, were significantly more highly 
expressed in colorectal tumour tissues than in the normal tissues 
(P< 0.001; Table 2). 

Differential expression between matched normal and tumour 
tissue. If markers are to be used for tumour- specific imaging or 
therapeutic delivery, the frequency and the magnitude of increased 
expression in tumours are key parameters. Expression scores were 
higher in tumours as compared with matched normal tissues in 
98.8%, 79.0%, 37.1% and 32.8% of cases for CEA, TAG-72, FRa 
and EGFR, respectively. Although this percentage is relatively low 
for FRa it should be noted that expression was higher in tumours 
as compared with matched normal tissue in almost all cases where 
positive staining was detected in either tissue. However, in the case 
of EGFR, expression was higher in normal tissue (34% of cases) 
more frequently than in tumours (33% of cases), indicating little or 
no bias for tumour- specific expression. We have estimated the 
magnitude of the differences in expression between matched 
tumour and normal tissues as the tumour expression score minus 
the normal expression score (Figure 3). Carcinoembryonic antigen 
demonstrated the greatest difference in expression, with tumours 
scoring on average 10.8 (95% CI 10.31-11.21) points higher than 
normal tissues. This difference was more than twice that of the 
next best marker, TAG-72, which showed tumour expression 5.1 
(95% CI 4.35-5.77) points higher than normal tissue, while FRa 
and EGFR showed only very small increases in expression within 
the tumours. 

Expression of markers in lymph nodes. If markers are to be used 
for imaging or therapeutic targeting of tumour deposits within 



CEA TAG-72 --. EGFR FRa 




Figure 1. Representative tissue microarray cores of normal colorectal tissue (top row) and colorectal tumours (bottom row) showing immuno- 
reactivity as labelled. The normal tissue cores shown demonstrate the median score for that antigen in normal colorectal tissues (CEA= 1; TAG- 
72 = 2; EGFR = 0; FRa = 0). The tumour cores shown demonstrate the median positive score for that antigen in colorectal tumours (CEA= 15; TAG- 
72 =10; EGFR = 2; FRa = 1). Scale bar=100^m. 
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Normal Tumour 




1 Cases 280 1 Cases 280 



Figure 2. Expression levels of CEA, TAG-72, EGFR or FRa in normal 
colorectal tissue (left) and in colorectal tumours (right). Expression 
levels were determined in 280 matched normal and tumour tissues by 
immunohistochemistry using semi-quantitative scores of 0-15. The 
scores for each marker are arranged independently in ascending order 
to demonstrate the distributions across the cohort. 



Table 2. CEA, TAG-72 and FRa are significantly more highly expressed in 
colorectal tumour tissue than matched normal tissue 



Antigen 


Mean normal score 


Mean tumour score 


P- value 


CEA 


1.7 


12.4 


<0.001 


TAG-72 


2.7 


7.8 


<0.001 


FRa 


0.0 


0.7 


<0.001 


EGFR 


0.9 


1.3 


0.08 



Abbreviations: CEA = carcinoembryonic antigen; EGFR = Epithelial growth factor receptor; 
FRa = folate receptor-a; TAG-72 = tumour-associated glycoprotein-72. 




Figure 3. Carcinoembryonic antigen shows the most consistent 
overexpression in tumour tissues and the greatest differential 
expression between matched normal and tumour tissues. Left: 
Expression scores for normal tissues were subtracted from those for 
matched tumour tissues to quantify the degree of tumour 
overexpression for each case. Overexpression scores for each marker 
are arranged in ascending order to demonstrate the distributions 
across the cohort (left). Minus scores reflect cases where tumour 
expression was lower than expression in the matched normal tissue. 
Mean overexpression scores (central marker) with 95% confidence 
intervals (error bars) are also shown (right). 



lymph nodes, marker expression both of tumour cells in tumour- 
cell positive lymph nodes and of normal lymph node tissues are 
critical. We examined expression of the four markers in lymph 
nodes taken during resection from 18 of the patients from our 
cohort. In each case, we examined a node found to contain 
metastatic tumour cells ('positive') and a node that was clear of 
metastatic deposits ('negative') (Figure 4). Expression of all 



Negative Positive 




1 Cases 18 1 Cases 18 



Figure 4. Expression levels of CEA, TAG-72, EGFR or FRa in lymph 
nodes, containing colorectal tumour deposits ('positive'; right) and in 
matched lymph nodes lacking tumour cells ('negative'; left). Expression 
levels were determined by immunohistochemistry using scores of 0-15 
in 18 cases. The scores for each marker are arranged independently in 
ascending order to demonstrate the distributions across the cohort 
(note: datapoints are linked by lines to aid interpretation of the 
distributions not to imply adjacent datapoints are directly related). 



markers was uncommon in negative nodes; TAG-72 showed the 
most staining, being expressed at only low levels by cells in the 
germinal centres in 5 out of 18 cases. The distributions of 
expression in positive nodes were similar to the patterns seen in 
primary tumours, with prevalent and strong expression of CEA 
and TAG-72, and less frequent and weaker expression of FRa 
and EGFR. 

Specificity and sensitivity of markers for tumour detection. 

Accurate imaging of tumours based on the expression of any 
marker is heavily influenced by the limit of detection of the 
imaging system for marker expression. We have modelled this 
influence for each marker by examining what proportion of 
tumours would be successfully visualised when different expression 
scores were defined as the limit of detection; this is the sensitivity. 
We have also tested what proportion of normal tissues would be 
'invisible', as required for tumour- specific imaging, using the same 
limits; this is the specificity. First, we arbitrarily set this cutoff point 
as the 95th percentile of the normal score distribution, thereby 
tolerating a 5% chance of incorrectly detecting normal tissue as 
positive (a specificity of 95%). This cutoff gave sensitivities of 
93.7%, 45.4%, 39.3% and 11.8% for CEA, TAG-72, FRa and EGFR, 
respectively. We then plotted a ROC curve for each marker 
(Supplementary Figure SI) and used these to select the cutoff 
scores that gave the greatest combined sensitivity and specificity. 
These cutoff scores gave sensitivities and specificities of 93.7% and 
96.1% for CEA, 70.3% and 78.5% for TAG-72, 39.3% and 99.2% for 
FRa, and 21.7% and 90.5% for EGFR. 



DISCUSSION 



In vivo tumour- specific targeting is a central requirement of many 
novel diagnostic and therapeutic strategies being developed for 
cancer. In colorectal cancer, the ability to detect and stage cancers 
intra- operatively has the potential to improve patient outcomes by 
accurate determination of resection margins and assessment of 
disease dissemination. Similarly, tumour-specific delivery of 
cytotoxic drugs or contrast agents for enhanced imaging has the 
potential to improve adjuvant treatment and diagnosis, respec- 
tively. Experimental work to date has relied mainly on antibodies 
directed at cell membrane proteins, such as CEA, TAG-72, FRa or 
EGFR (Reddy and Low, 1998; Muguruma et al y 1999; Zou et al 
2009; Goetz et al, 2010), although the field is advancing with the 
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introduction of aptamer-based approaches. Surprisingly, given the 
dependency on accurate biomarker localisation, the selection of the 
markers used for tumour targeting is rarely justified based on their 
sensitivity or specificity. One would not contemplate using a 
marker giving poor specificity and sensitivity in routine laboratory 
analysis, and the same stringent criteria should therefore apply to 
in vivo tumour identification. 

We examined the expression of four most commonly used 
colorectal cancer biomarkers in a large cohort of matched normal 
and tumour tissue using immunohistochemistry to determine their 
potential suitability for in vivo diagnostic and therapeutic 
application. Markers proving unsuitable in this analysis would 
most likely have no potential utility for the in vivo applications, 
which are even further limited by additional variables, such as 
tissue penetration, immunological response, and expression of 
markers within other normal tissues. To make our assessment we 
investigated three key criteria: (i) the proportion of the cohort in 
which tumours stained positively for markers, (ii) the proportion 
of the cohort in which tumours stained more strongly for markers 
than the matched normal tissues, and (iii) the magnitude of 
overexpression in tumours. Carcinoembryonic antigen out-per- 
formed other markers dramatically in all three measures, showing 
the most frequent tumour expression, and the most frequent and 
greatest tumour overexpression (Figures 2 and 3). In particular, 
differential CEA expression within normal/tumour pairs was 
notably greater as compared with other markers, providing 
evidence that CEA is the most reliable marker for differentiating 
between normal and tumour tissue. In the context of lymph nodes, 
CEA was again the superior target, with high levels and frequency 
of expression within positive nodes, and only minimal expression 
in normal lymph node tissue (Figure 4). 

In practical imaging situations, the threshold, or cutoff, used to 
differentiate between normal and tumour would be determined by 
the user, by adjusting the signal gain. This could be altered to 
favour either sensitivity (lowering the cutoff to detect tumours 
more easily at the risk of detecting normal tissue erroneously) or 
specificity (raising the cutoff to avoid detection of normal tissue at 
the risk of missing tumour cells). We quantified the abilities of the 
markers to be used in tumour cell detection using their optimal 
cutoffs, as determined using ROC analyses, in terms of sensitivity 
(proportion of tumours correctly detected) and specificity 
(proportion of normal tissues correctly not detected). Carcinoem- 
bryonic antigen was the most sensitive marker by a considerable 
margin (93.7%, as compared with the nearest alternative TAG-72 
at 70.3%), and was also highly specific (96.1%). Folate receptor-a 
was the most specific marker (99.2%); however, its utility is limited 
by the fact that it was not detectable in the majority of tumours, as 
previously reported (Shia et al, 2008), resulting in a poor sensitivity 
(39.3%). In addition, the magnitude of differential expression 
between normal and tumour was relatively narrow, providing less 
security in determining tumour or normal identities. Most notably, 
EGFR was a surprisingly poor marker, being detected in less than 
half the tumours and being commonly more highly expressed in 
normal tissues than in tumours. For applications aiming at targeted 
delivery of therapeutics, similar considerations apply. Markers 
must allow a balance between ensuring tumour cells receive the 
cytotoxic insult while minimising collateral damage in normal cells 
that may express the antigen. In this context, FRa may be an 
appropriate target in the tumours that express it, with very high 
specificity allowing minimal damage to normal cells. However, 
CEA is more suitable in a much larger proportion of tumours, 
accepting that normal tissues may suffer some dose of drug, albeit a 
much lower dose than the tumour cells. 

We therefore conclude that of the four biomarkers tested, CEA 
alone is the most suitable for tumour targeting, and this should be 
reflected in future strategies for in vivo tumour diagnosis and 
therapeutic delivery. 
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